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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT 

(US only): ANTALIS Toni Marie and HOOPER John David 
(Other than US): AMRAD OPERATIONS PTY LTD 

(ii) TITLE OF INVENTION: NOVEL MOLECULES 

(iii) NUMBER OF SEQUENCES: 30 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIES COLLISON CAVE 

(B) STREET: 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US Application 

(B) FILING DATE: 13-FEB- 1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PO5101/97 

(B) FILING DATE: 13-FEB-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PP0422/97 

(B) FILING DATE: 18-NOV-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: International PCT Application 

(B) FILING DATE: 13-FEB-1998 

(C) CLASSIFICATION: 
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(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: DIGIGLIO. FRANK S 

(B) REGISTRATION NO: 31,346 

(C) REFERENCE/DOCKET NUMBER: 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (5 1 6) 742 4343 . 

(B) TELEFAX: (516) 742 4366 

(C) TELEX: 230 901 SANS UR 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 



ACAGAATTCT GGGTIGTIAC IGCIGCICAY TG 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
ACAGAATTCA XIGGICCICC IC/GT/AXTCICC 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 17. .965 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



CGCGGGAGAG GAGGCC ATG GGC GCG CGC GGG GCG CTG CTG CTG GCG CTG 
Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu 
1 5 10 



49 



a 
a 



r 3 ? 



CTG CTG GCT CGG GCT GGA CTC AGG AAG CCG GAG TCG CAG GAG GCG GCG 97 
Leu Leu Ala Arg Ala Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala 
15 20 25 

CCG TTA TCA GGA CCA TGC GGC CGA CGG GTC ATC ACG TCG CGC ATC GTG 145 
Pro Leu Ser Gly Pro Cys Gly Arg Arg Val lie Thr Ser Arg lie Val 
30 35 40 

GGT GGA GAG GAC GCC GAA CTC GGG CGT TCG CCG TGG CAG GGG AGC. CTG 193 
Gly Gly Glu Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu 
45 50 55 



CGC CTG TGG GAT TCC CAC GTA TGC GGA GTG AGC CTG CTC AGC CAC CGC 
Arg Leu Trp Asp Ser His Val Cys Gly Val Ser Leu Leu Ser His Arg 
60 65 70 75 



241 



TGG GCA CTC ACG GCG GCG CAC TGC TTT GAA ACT GAC CTT AGT GAT CCC 
Trp Ala Leu Thr Ala Ala His Cys Phe Glu Thr Asp Leu Ser Asp Pro 
80 35 90 



289 



TCC GGG TGG ATG GTC CAG TTT GGC CAG CTG ACT TCC ATG CCA TCC TTC 
Ser Gly Trp Met Val Gin Phe Gly Gin Leu Thr Ser Met Pro Ser Phe 
95 100 105 



337 



TGG AGC CTG CAG GCC TAC TAC ACC CGT TAC TTC GTA TCG AAT ATC TAT 
Trp Ser Leu Gin Ala Tyr Tyr Thr Arg Tyr Phe Val Ser Asn He Tyr 
110 115 120 



385 



CTG AGC CCT CGC TAC CTG GGG AAT TCA CCC TAT GAC ATT GCC TTG GTG 
Leu Ser Pro Arg Tyr Leu Gly Asn Ser Pro Tyr Asp He Ala Leu Val 
125 130 135 



433 



AAG CTG TCT GCA CCT GTC ACC TAC ACT AAA CAC ATC CAG CCC ATC TGT 
Lys Leu Ser Ala Pro Val Thr Tyr Thr Lys His He Gin Pro He Cys 
140 145 ISO 155 



481 



CTC CAG GCC TCC AC A TTT GAG TTT GAG AAC CGG ACA GAC TGC TGG GTG 
Leu Gin Ala Ser Thr Phe ciu Phe Glu Asn Arg Thr Asp Cys Trp Val 
160 165 170 



529 
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ACT GGC TGG GGG TAC ATC AAA GAG GAT GAG GCA CTG CCA TCT CCC CAC 
Thr Gly Trp Gly Tyr He Lys Glu Asp Glu Ala Leu Pro Ser Pro His 
175 180 185 

ACC CTC CAG GAA GTT CAG GTC GCC ATC ATA AAC AAC TCT ATG TGC AAC 
Thr Leu Gin Glu Val Gin Val Ala He He Asn Asn Ser Met Cys Asn 
190 195 200 

CAC CTC TTC CTC AAG TAC AGT TTC CGC AAG GAC ATC TTT GGA GAC ATG 
His Leu Phe Leu Lys Tyr Ser Phe Arg Lys Asp He Phe Gly Asp Met 
205 210 215 

GTT TGT GCT GGC AAT GCC CAA GGC GGG AAG GAT GCC TGC TTC GGT GAC 
Val Cys Ala Gly Asn Ala Gin Gly Gly Lys Asp Ala Cys Phe Gly Asp 
220 225 230 235 

TCA GGT GGA CCC TTG GCC TGT AAC AAG GAT GGA CTG TGG TAT CAG ATT 
Ser Gly Gly Pro Leu Ala Cys Asn Lys Asp Gly Leu Trp Tyr Gin He 
240 245 250 

GGA GTC GTG AGC TGG GGA GTG GGC TGT GGT CGG CCC AAT CGG CCC GGT 
Gly Val Val Ser Trp Gly Val Gly Cye Gly Arg Pro Asn Arg Pro Gly 
255 260 265 

GTC TAC ACC AAT ATC AGC CAC CAC TTT GAG TGG ATC CAG AAG CTG ATG 
Val Tyr Thr Asn He. Ser His His Phe Glu Trp He Gin Lys Leu Met 
270 275 280 

GCC CAG AGT GGC ATG TCC CAG CCA GAC CCC TCC TGG CCG CTA CTC TTT 
Ala Gin Ser Gly Met Ser Gin Pro Asp Pro Ser Trp Pro Leu Leu Phe 
2S5 290 295 

TTC CCT CTT CTC TGG GCT CTC CCA CTC CTG GGG CCG GTC TGA 
Phe Pro Leu Leu Trp Ala Leu Pro Leu Leu Gly Pro Val * 
300 305 310 

GCCTACCTGA GCCCATGCAG CCTGGGGCCA CTGCCAAGTC AGGCCCTGGT TCTCTTCTGT 
CTTGTTTGGT AATAAACACA TTCCAGTTGA TGCCTTGCAG GGCATTTTTC AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAA 



(2) INFORMATION FOR SEQ ID NO : 4 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu Leu Leu Ala Arg Ala 
15 10 15 

Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala Pro Leu Ser Gly Pro 
20 25 30 

Cys Gly Arg Arg Val He Thr Ser Arg He Val Gly Gly Glu Asp Ala 
35 40 45 

Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu Arg Leu Trp Asp Ser 
50 55 60 

His Val Cys Gly Val Ser Leu Leu Ser His Arg Trp Ala Leu Thr Ala 
65 70 75 80 

Ala His Cys Phe Glu Thr Asp Leu Ser Asp Pro Ser Gly Trp Met Val 
85 90 95 

Gin Phe Gly Gin Leu Thr Ser Met Pro Ser Phe Trp Ser Leu Gin Ala 
100 105 HO 

Tyr Tyr Thr Arg Tyr Phe Val Ser Asn He Tyr Leu Ser Pro Arg Tyr 
115 120 125 

Leu Gly Asn Ser Pro Tyr Asp He Ala Leu Val Lys Leu Ser Ala Pro 
130 135 140 

Val Thr Tyr Thr Lys His He Gin Pro He Cys Leu Gin Ala Ser Thr 
i45 150 155 160 

Phe Glu Phe Glu Asn Arg Thr Asp Cys Trp. Val Thr Gly Trp Gly Tyr 
165 170 175 



He Lys Glu Asp Glu Ala Leu Pro Ser Pro His Thr Leu Gin Glu Val 
180 185 190 
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Gin Val Ala He lie Asn Asn Ser Met Cys Asn His Leu Phe Leu Lys 
195 200 205 

Tyr Ser Phe Arg Lys Asp He Phe Gly Asp Met Val Cys Ala Gly Asn 
210 215 220 

Ala Gin Gly Gly Lys Asp Ala Cys Phe Gly Asp Ser Gly Gly Pro Leu 
225 230 - 235 240 

Ala Cys Asn Lys Asp Gly Leu Trp Tyr Gin He Gly Val Val Ser Trp 
245 250 255 

Gly Val Gly Cys Gly Arg Pro Asn Arg Pro Gly Val Tyr Thr Asn He 
260 265 270 

Ser His His Phe Glu Trp He Gin Lys Leu Met Ala Gin Ser Gly Met 
275 230 285 

Ser Gin Pro Asp Pro Ser Trp Pro Leu Leu Phe Phe Pro Leu Leu Trp 
290 295 300 



Ala Leu Pro Leu Leu Gly Pro Val * 
305 310 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 17.. 961 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CGCGGGAGAG GAGGCC ATG GGC GCG CGC GGG GCG CTG CTG CTG GCG CTG 
Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu 
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CTG CTG GCT CGG GCT GGA CTC AGG AAG CCG GAG TCG CAG GAG GCG GCG 
Leu Leu Ala Arg Ala Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala 
15 20 25 

CCG TTA TCA GGA CCA TGC GGC CGA CGG GTC ATC ACG TCG CGC ATC GTG 
Pro Leu Ser Gly Pro Cys Gly Arg Arg Val lie Thr Ser Arg lie Val 
30 35 40 

GGT GGA GAG GAC GCC GAA CTC GGG CGT TGG CCG TGG CAG GGG AGC CTG 
Gly Gly Glu Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu 
45 50 55 

CGC CTG TGG GAT TCC CAC GTA TGC GGA GTG AGC CTG CTC AGC CAC CGC 
Arg Leu Trp Asp Ser His Val Cys Gly Val Ser Leu Leu Ser His Arg 
60 65 70 75 

TGG GCA CTC ACG GCG GCG CAC TGC TTT GAA ACC TAT AGT GAC CTT AGT 
Trp Ala Leu Thr Ala Ala Kis Cys Phe Glu Thr. Tyr Ser Asp Leu Ser 
80 35 90 

GAT CCC TCC GGG TGG ATG GTC CAG TTT GGC CAG CTG ACT TCC ATG CCA 
Asp Pro Ser Gly Trp Mec Val Gin Phe Gly Gin Leu Thr Ser Met Pro 
95 100 105 

TCC TTC TGG AGC CTG CAG GCC TAC TAC ACC CGT TAC TTC GTA TCG AAT 
Ser Phe Trp Ser Leu Gin Ala Tyr Tyr Thr Arg Tyr Phe Val Ser Asn 
110 115 120 

ATC TAT CTG AGC CCT CGC TAC CTG GGG AAT TCA CCC TAT GAC ATT GCC 
lie Tyr Leu Ser Pro Arg Tyr Leu Gly Asn Ser Pro Tyr Asp lie Ala 
125 130 135 

TTG GTG AAG CTG TCT GCA CCT GTC ACC TAC ACT AAA CAC ATC CAG CCC 
Leu Val Lys Leu Ser Ala Pro Val Thr Tyr Thr Lys His lie Gin Pro 
140 145 150 155 



ATC TGT CTC CAG GCC TCC ACA TTT GAG TTT GAG AAC CGG ACA GAC TGC 
He Cys Leu Gin Ala Ser Thr Phe Glu Phe Glu Asn Arg Thr Asp Cys 
160 165 170 



TGG GTG ACT GGC TGG GGG TAC ATC AAA GAG GAT GAG GCA CTG CCA TCT 
Trp Val Thr Gly Trp Gly Tyr He Lys Glu Asp Glu Ala Leu Pro Ser 
175 180 135 
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CCC CAC ACC CTC CAG GAA GTT CAG GTC GCC ATC ATA AAC AAC TCT ATG 625 
Pro His Thr Leu Gin Glu Val Gin Val Ala lie lie Asn Asn Ser Met 
190 195 200 

TGC AAC CAC CTC TTC CTC AAG TAC AGT TTC CGC AAG GAC ATC TTT GGA 673 
Cys Asn His Leu Phe Leu Lys Tyr Ser Phe Arg Lys Asp He Phe Gly 
205 210 21S 

GAC ATG GTT TGT GCT GGC AAT GCC CAA GGC GGG AAG GAT GCC TGC TTC 721 
Asp Met val Cys Ala Gly Asn Ala Gin Gly Gly Lys Asp Ala Cys Phe 
220 225 230 235 

GGT GAC TCA GGT GGA CCC TTG GCC TGT AAC AAG GAT GGA CTG TGG TAT 769 
Gly Asp Ser Gly. Gly Pro Leu Ala Cys Asn Lys Asp Gly Leu Trp Tyr 
240 245 250 

CAG ATT GGA GTC GTG AGC TGG GGA ( GTG GGC TGT GGT CGG CCC AAT CGG 817 
. Gin He Gly Val Val Ser Trp Gly Val Gly Cys Gly Arg Pro Asn Arg 
255 260 265 

CCC GGT GTC TAC ACC AAT ATC AGC CAC CAC TTT GAG TGG ATC CAG AAG 8 65 

Pro Gly Val Tyr Thr Asn He Ser His His Phe Glu Trp He Gin Lys 
270 275 230 

CTG ATG GCC CAG AGT GGC ATG TCC CAG CCA GAC CCC TCC TGG CCG CTA 913 
Leu Met Ala Gin Ser Gly Met Ser Gin Pro Asp Pro Ser Trp Pro Leu 
285 290 295 

CTC TTT TTC CCT CTT CTC TGG GCT CTC CCA CTC CTG GGG CCG GTC TGAGCCTACC 
968 

Leu Phe Phe Pro Leu Leu Trp Ala Leu Pro Leu Leu Gly Pro Val 
300 305 310 315 

TG AGC CC ATG CAGCCTGGGG CCACTGCCAA GTCAGGCCCT GGTTCTCTTC TGTCTTGTTT 1028 

GGTAATAAAC ACATTCCAGT TGATGCCTTG CAGGGCATTT TTCAAAAAAA AAAAAAAAAA 1088 

AAAAAAAAAA AA 1100 



(2| INFORMATION FOR SSQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Mec Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu Leu Leu Ala Arg Ala 
1 5 10 15 

Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala Pro Leu Ser Gly Pro 
20 25 30 

Cys Gly Arg Arg Val lie Thr Ser Arg He val Gly Gly Glu Asp Ala- 
35 40 45-. 

Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu Arg Leu Trp Asp Ser 
50 5 5 60 

His Val Cys Gly Val Ser Leu Leu Ser His Arg Trp Ala Leu Thr Ala 
65 70 75 80 

Ala His Cys Phe Glu Thr Tyr Ser Asp Leu Ser Asp Pro Ser Gly Trp 
85 90 95 

Met Val Gin Phe Gly Gin Leu Thr Ser Met Pro Ser Phe Trp Ser Leu 
100 105 110 

Gin Ala Tyr Tyr Thr Arg Tyr Phe Val Ser Asn lie Tyr Leu Ser Pro 
115 120 125 

Arg Tyr Leu Gly Asn Ser Pro Tyr Asp He Ala Leu Val Lys Leu Ser 
130 135 140 

Ala Pro Val Thr Tyr Thr Lys His He Gin Pro He Cys Leu Gin Ala 
145 150 155 160 

Ser Thr Phe Glu Phe Glu Asn Arg Thr Asp Cys Trp Val Thr Gly Trp 
165 170 175 

Gly Tyr lie Lys Glu Asp Glu Ala Leu Pro Ser Pro His Thr Leu Gin 
130 155 190 



Glu Val Gin Val Ala He He Asn Asn Ser Met Cys Asn His Leu Phe 
195 200 205 



f»*Of8R\EJH\PO5101.p.0Al • WW* 



-66- 

Leu Lys Tyr Ser Phe Arg Lys Asp He Phe Gly Asp Met Val Cys Ala' 
210 215 220 

Gly Asn Ala Gin Gly Gly Lys Asp Ala Cys Phe Gly Asp Ser Gly Gly 
225 230 235 240 

Pro Leu Ala Cys Asn Lys Asp Gly Leu Trp Tyr Gin He Gly Val Val 
245 250 255 

Ser Trp Gly Val Gly Cys Gly Arg Pro Asn Arg Pro Gly Val Tyr Thr 
260 265 270 

Asn He Ser His His Phe Glu Trp He Gin Lys Leu Met Ala Gin Ser 
275 280 285 

Gly Met Ser Gin Pro Asp Pro Ser Trp Pro Leu Leu Phe Phe Pro Leu 
290 295 300 

Leu Trp Ala Leu Pro Leu Leu Gly Pro Val 
305 310 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 24.. 79 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
AGTTCAGATG AATGGGACTG TGA GAA CCA TCT GTG ACC AAA TTG ATA CAG 

Glu pro Ser Val Thr Lys Leu He Gin 
1 5 

GAA CAG GAG AAA GAG CCG CGG TGG CTG AC A TTA CAC TCC AAC TGG GAG 
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Glu Gin Glu Lys Glu Pro Arg Trp Leu Thr Leu His Ser Asn Trp Glu 
10 15 20 25 

AGC CTC AAT GGG ACC ACT TTA CAT GAA CTT GTA GTA AAT GGG CAG TCT 
Ser Leu Asn Gly Thr Thr Leu His Glu Leu Val Val Asn Gly Gin Ser 
30 35 40 

TGT GAG AGC AGA AGT AAA ATT TCT CTT CTG TGT ACT AAA CAA GAC TGT 
Cys Glu Ser Arg Ser Lys lie Ser Leu Leu Cys Thr Lys Gin Asp Cys 
45 50 55 

GGG CGC CGC CCT GCT GCC CGA ATG AAC AAA AGG ATC CTT GGA GGT CGG 
Gly Arg Arg Pro Ala Ala Arg Met Asn Lys Arg He Leu Gly Gly Arg 
60 65 70 . 

ACG AGT CGC CCT GGA AGG TGG CCA TGG CAG TGT TCT CTG CAG AGT GAA 
Thr Ser Arg Pro Gly Arg Trp Pro Trp Gin Cys Ser Leu Gin Ser Glu 
75 80 85 

CCC AGT GGA CAT ATC TGT GGC TGT GTC CTC ATT GCC AAG AAG TGG GTT 
Pro Ser Gly His He Cys Gly Cys Val Leu lie Ala Lys Lys Trp Val 
90 95 100 105 

GTG ACA GTT GCC CAC TGC TTC GAG GGG AGA GAG AAT GCT GCA GTT TGG 
Val Thr Val Ala His Cys Phe Glu Gly Arg Glu Asn Ala Ala Val Trp 
110 115 120 

AAA GTG GTG CTT GGC ATC AAC AAT CTA GAC CAT CCA TCA GTG TTC ATG 
Lys Val Val Leu Gly He Asn Asn Leu Asp His Pro Ser Val Phe Met 
125 130 135 

CAG ACA CGC TTT GTG AGG ACC ATC ATC CTG CAT CCC CGC TAC AGT CGA 
Gin Thr Arg Phe Val Arg Thr He He Leu His Pro Arg Tyr Ser Arg 
140 145 150 

GCA GTG GTG GAC TAT GAC ATC AGC ATC GTT GAG CTG AGT GAA GAC ATC 
Ala Val Val Asp Tyr Asp He Ser He Val Glu Leu Ser Glu Asp He 
155 160 165 



AGT GAG ACT GGC TAC GTC CGG CCT GTC TGC TTG CCC AAC CCG GAG CAG 
Ser Glu Thr Gly Tyr Val Arg Pro Val Cys Leu Pro Asn Pro Glu Gin 
170 175 180 195 



TGG CTA GAG CCT GAC ACG TAC TGC TAT ATC ACA GGC TGG GGC CAC ATG 
Trp Leu Glu Pro Asp Thr Tyr Cys Tyr He Thr Gly Trp Gly His Met 
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190 195 200 

GGC AAT AAA ATG CCA TTT AAG CTG CAA GAG GGA GAG GTC CGC ATT ATT 
Gly Asn Lys Met Pro Phe Lys Leu Gin Glu Gly Glu Val Arg He He 
205 210 215 



TCT CTG GAA CAT TGT CAG TCC TAC TTT GAC ATG AAG ACC ATC ACC ACT 
Ser Leu Glu His Cys Gin Ser Tyr Phe Asp Mec Lys Thr He Thr Thr 
220 225 230 



CGG ATG ATA TGT GCT GGC TAT GAG TCT GGC ACA GTT GAT TCA TGC ATG 
Arg Mec He Cys Ala Gly Tyr Glu Ser Gly Thr Val Asp Ser Cys Met 
235 240 245 

GGT GAC TGG GGC GGT CCG TTG AAT TCT GT 
Gly Asp Trp Gly Gly Pro Leu Asn Ser 
250 255 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Glu Pro Ser Val Thr Lys Leu He Gin Glu Gin Glu Lys Glu Pro Arg 
15 10 is 



Trp Leu Thr Leu His Ser Asn Trp Glu Ser Leu Asn Gly Thr Thr Leu 
20 25 30 

His Glu Leu Val Val Asn Gly Gin Ser Cys Glu Ser Arg Ser Lys He 
35 40 45 

Ser Leu Leu Cys Thr Lys Gin Asp Cys Gly Arg Arg Pro Ala Ala Arg 
50 55. 60 



Mec Asn Lys Arg He Leu Gly Gly Arg Thr Ser Arg Pro Gly Arg Trp 
65 70 75 80 
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Pro Trp Gin Cys Ser Leu Gin Ser Glu Pro Ser Gly His lie Cys Gly 
85 90 95 

Cys Val Leu lie Ala Lys Lys Trp Val Val Thr Val Ala His Cys Phe 
100 105 110 

Glu Gly Arg Glu Asn Ala Ala Val Trp Lys Val Val Leu Gly He Asn 
115 120 125 



Q 

a 



ri 
Us 



Asn Leu Asp His Pro Ser Val Phe Met Gin Thr Arg Phe Val Arg Thr 
130 135 140 

He He Leu His Pro Arg Tyr Ser Arg Ala Val Val Asp Tyr Asp He 
145 150 155 160 

Ser He Val Glu Leu Ser Glu Asp He Ser Glu Thr Gly Tyr Val Arg 
165 170 175 



Q 
u 

B 

D 

SI 



Pro val Cys Leu Pro Asn Pro Glu Gin Trp Leu Glu Pro Asp Thr Tyr 
180 185 190 

Cys Tyr He Thr Gly Trp Gly His Met Gly Asn Lys Met Pro Phe Lys 
195 200 205 

Leu Gin Glu Gly Glu Val Arg He He Ser Leu Glu His Cys Gin Ser 
210 215 220 

Tyr Phe Asp Met Lys Thr He Thr Thr Arg Met He Cys Ala Gly Tyr 
225 230 235 240 

Glu Ser Gly Thr Val Asp Ser Cys Met Gly Asp Trp Gly Gly Pro Leu 
245 250 255 



Asn Ser 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2241 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



• # 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 166. .1773 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
ATTTAATACG ACTCACTATA GGGAATTTGG CCCTCGAGGA ACAATTCGGC ACGAGGCTGC 60 

I" GGCGC ACTGT GAGGGAGTCG CTGTGATCCG GGGCCCCGAA CCCGACTGGA GCTGAAGCGC 120 

??A 

'II AGGCTGCGGG GCGCGGAGTC GGGAGGCCTG AGTGTTCCTT CCAGC ATG TCG GAG 
!:! Met Ser Glu 

n 

=?; GGG GAG TCC CAG ACA GTA CTT AGC AGT GGC TCA GAC CCA AAG GTA GAA 

Gly Glu Ser Gin Thr Val Leu Ser Ser Gly Ser Asp Pro Lys Val Glu 
5 5 10 15 

9 

M TCT ^A TCT TCA GCT CCT GGC CTG ACA TCA GTG TCA CCT CCT GTG ACC 

O Ser Ser Ser Ser Ala Pro Gly Leu Thr Ser Val Ser Pro Pro Val Thr 

M 20 25 • 30 35 

Q 

fll TCC ACA ACC TCA GCT GCT TCC CCA GAG GAA GAA GAA GAA AGT GAA GAT 

Ser Thr Thr Ser Ala Ala Ser Pro Glu Glu Glu Glu Glu Ser Glu Asp 
40 45 50 

GAG TCT GAG ATT TTG GAA GAG TCG CCC TGT GGG CGC TGG CAG AAG AGG 
Glu Ser Glu lie Leu Glu Glu Ser Pro Cys Gly Arg Trp Gin Lys Arg 
55 60 65 

CGA GAA GAG GTG AAT CAA CGG AAT GTA CCA GGT ATT GAC AGT GCA TAC 
Arg Glu Glu Val Asn Gin Arg Asn Val Pro Gly He Asp Ser Ala Tyr 
70 "75 80 

CTG GCC ATG GAT ACA GAG GAA GGT GTA GAG GTT GTG TGG AAT GAG GTA 
Leu Ala Met Asp Thr Glu Glu Gly Val Glu Veil Val Trp Asn Glu Val 
8S 90 95 

CAG TTC TCT GAA CGC AAG AAC TAC AAG CTG CAG GAG GAA AAG GTT TGT 
Gin Phe Ser Glu Arg Lys Asn Tyr Lys Leu Gin Glu Glu Lys Val Cys 
1°° 105 110 115 



174 



222 



270 



318 



366 



414 



462 



510 



GCT GTG TTT GAT AAT TTG ATT CAA TTG GAG CAT CTT AAC ATT GTT AAG 



553 
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Ala val Phe Asp Asn Leu He Gin Leu Glu His Leu Asn He Val Lys 
120 125 130 

TTT CAC AAA TAT TGG GCT GAC ATT AAA GAG AAC AAG GCC AGG GTC ATT 
Phe His Lys Tyr Trp Ala Asp He Lys Glu Asn Lys Ala Arg Val He 
135 140 145 

TT'T ATC ACA GGA TAC ATG TCA TCT GGG AGT CTG AAG CAA TTT CTG AAG 
Phe He Thr Gly Tyr Met Ser Ser Gly Ser Leu Lys Gin Phe Leu Lys 
150 155 160 

AAG ACC CAA AAG AAC CAC CAG ACG ATG AAT GAA AAG GCA TGG AAG CGT 
Lys Thr Gin Lys Asn His Gin Thr Met Asn Glu Lys Ala Trp Lys Arg 
165 170 175 

TGG TGC ACA CAA ATC CTC TCT GCC CTA AGC TAC CTG CAC TCC TGT GAC 
Trp Cys Thr Gin He Leu Ser Ala Leu Ser Tyr Leu His Ser Cys Asp 
180 185 190 195 

CCC CCC ATC ATC CAT GGG AAC CTG ACC TGT GAC ACC ATC TTC ATC CAG 
Pro Pro He He His Gly Asn Leu Thr Cys Asp Thr He Phe He Gin 
200 205 210 

CAC AAC GGA CTC ATC AAG ATT GGC TCT GTG GCT CCT GAC ACT ATC AAC 
His Asn Gly Leu He Lys He Gly Ser Val Ala Pro Asp Thr He Asn 
215 220 225 

AAT CAT GTG AAG ACT TGT CGA GAA GAG CAG AAG AAT CTA CAC TTC TTT 
Asn His Val Lys Thr Cys Arg Glu Glu Gin Lys Asn Leu His Phe Phe 
230 235 240 

GCA CCA GAG TAT GGA GAA GTC ACT AAT GTG ACA ACA GCA GTG GAC ATC 
Ala Pro Glu Tyr Gly Glu Val Thr Asn Val Thr Thr Ala Val Asp He 
245 250 255 

TAC TCC TTT GGC ATG TGT GCA CTG GGG ATG GCA GTG CTG GAG ATT CAG 
Tyr Ser Phe Gly Met Cys Ala Leu Gly Met Ala Val Leu Glu He Gin 
260 265 270 275 

GGC AAT GGA GAG TCC TCA TAT GTG CCA CAG GAA GCC ATC AGC AGT GCC 
Gly Asn Gly Glu Ser Ser Tyr Val Pro Gin Glu Ala He Ser Ser Ala 
280 285 290 

ATC CAG CTT CTA GAA GAC CCA TTA CAG AGG GAG TTC ATT CAA AAG TGC 
He Gin Leu Leu Glu Asp Pro Leu Gin Arg Glu Phe He Gin Lys Cys 




fri 



s 

a 

D 
?3I 



72- 



295 300 305 

CTG CAG TCT GAG CCT GCT CGC AGA CCA ACA GCC AGA GAA CTT CTG TTC 1134 
Leu Gin Ser Glu Pro Ala Arg Arg Pro Thr Ala Arg Glu Leu Leu Phe 
310 315 320 

CAC CCA GCA TTG TTT GAA GTG CCC TCG CTC AAA CTC CTT GCG GCC CAC 1182 
His Pro Ala Leu Phe Glu Val Pro Ser Leu Lys Leu Leu Ala Ala His 
325 330 335 

TGC ATT GTG GGA CAC CAA CAC ATG ATC CCA GAG AAC GCT CTA GAG GAG 1230 
Cys He Val Gly His Gin His Met He Pro Glu Asn Ala Leu Glu Glu 
340 345 350 355 

ATC ACC AAA AAC ATG GAT ACT AGT GCC GTA CTG GCT GAA ATC CCT GCA 127 8 

He Thr Lys Asn Met Asp Thr Ser Ala Val Leu Ala Glu He Pro Ala 
360 365 370 

GGA CCA GGA AGA GAA CCA GTT CAG ACT TTG TAC TCT CAG TCA CCA GCT 1326 
Gly Pro Gly Arg Glu Pro Val Gin Thr Leu Tyr Ser Gin Ser Pro Ala 
375 380 385 

CTG GAA TTA GAT AAA TTC CTT GAA GAT GTC AGG AAT GGG ATC TAT CCT 1374 
Leu Glu Leu Asp Lys Phe Leu Glu Asp Val Arg Asn Gly He Tyr Pro 
390 395 400 

CTG ACA GCC TTT GGG CTG CCT CGG CCC CAG CAG CCA CAG CAG GAG GAG 1422 
Leu Thr Ala Phe Gly Leu Pro Arg Pro Gin Gin Pro Gin Gin Glu Glu 
405 410 415 

GTG ACA TCA CCT GTC GTG CCC CCC TCT GTC AAG ACT CCG ACA CCT GAA 1470 
Val Thr Ser Pro Val Val Pro Pro Ser Val Lys Thr Pro Thr Pro Glu 
420 425 430 435 

CCA GCT GAG GTG GAG ACT CGC AAG GTG GTG CTG ATG CAG TGC AAC ATT 1513 
Pro Ala Glu Val Glu Thr Arg Lys Val Val Leu Met Gin Cys Asn He 
440 445 450 

GAG TCG GTG GAG GAG GGA GTC AAA CAC CAC CTG ACA CTT CTG CTG AAG 1566 
Glu Ser Val Glu Glu Gly Val Lys His His Leu Thr Leu Leu Leu Lys 
455 460 465 



TTG GAG GAC AAA CTG AAC CGG CAC CTG AGC TGT GAC CTG ATG CCA AAT 
Leu Glu Asp Lys Leu Asn Arg His Leu Ser Cys Asp Leu Met Pro Asn 
470 475 480 



1614 
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GAG AAT ATC CCC GAG TTG GCG GCT GAG CTG GTG CAG CTG GGC TTC ATT 
Glu Asn He Pro Glu Leu Ala Ala Glu Leu Val Gin Leu Gly Phe lie 
485 490 495 

AGT GAG GCT GAC CAG AGC CGG TTG ACT TCT CTG CTA GAA GAG ACC TTG 
Ser Glu Ala Asp Gin Ser Arg Leu Thr Ser Leu Leu Glu Glu Thr Leu 
500 505 510 515 

AAC AAG TTC AAT TTT GCC AGG AAC AGT ACC CTC AAC TCA GCC GCT GTC 
Asn Lys Phe Asn Phe Ala Arg Asn Ser Thr Leu Asn Ser Ala Ala Val 
520 525 530 

ACC GTC TCC TCT TAGAGCTCAC TCGGGCCAGG CCCTGATCTG CGCTGTGGCT 
Thr Val Ser Ser 
535 

GTCCCTGGAC GTGCTGCAGC CCTCCTGTCC CTTCCCCCCA GTCAGTATTA CCCTGTGAAG 
CCCCTTCCCT CCTTTATTAT TCAGGAGGGC TGGGGGGGCT CCCTGGTTCT GAGCATCATC 
CTTTCCCCTC CCCTCTCTTC CTCCCCTCTG CACTTTGTTT ACTTGTTTTG CACAGACGTG 
GGCCTGGGCC TTCTCAGCAG CCGCCTTCTA GTTGGGGGCT AGTCGCTGAT CTGCCGGCTC 
CCGCCCAGCC TGTGTGGAAA GGAGGCCCAC GGGCACTAGG GGAGCCGAAT TCTACAATCC 
CGCTGGGGCG GCCGGGGCGG GAGAGAAAGG TGGTGCTGCA GTGGTGGC CC TGGGGGGCCA 
TTCGATTCGC CTCAGTTGCT GCTGTAATAA AAGTCTACTT TTTGCTAAAA AAAAAAAAAA 
AAAAAAAAAA A 



(2) INFORMATION FOR SEQ ID NO: 10: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 amino acids 

(B) TYPE: amino acid 
(O) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Mec Ser Glu Gly Glu Ser Gin Thr Val Leu Ser Ser Gly Ser Asp Pro 
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Lys Val Glu Ser Ser Ser Ser Ala Pro Gly Leu Thr Ser Val Ser Pro 
20 25 30 

Pro Val Thr Ser Thr Thr Ser Ala Ala Ser Pro Glu Glu Glu Glu Glu 
35 40 45 

Ser Glu Asp Glu Ser Glu lie Leu Glu Glu Ser Pro Cys Gly Arg Trp 
50 55 60 

Gin Lys "Arg Arg Glu Glu Val Asn Gin Arg Asn Val Pro Gly lie Asp 
65 70 75 80 

Ser Ala Tyr Leu Ala Met Asp Thr Glu Glu Gly Val Glu val Val Trp 
85 90 95 

Asn Glu Val Gin Phe Ser Glu Arg Lys Asn Tyr Lys Leu Gin Glu Glu 

100 105 . __j . no 

Lys Val Cys Ala Val Phe Asp Asn Leu lie Gin Leu Glu His Leu Asn 
115 120 125 

lie val Lys Phe His Lys Tyr Trp Ala Asp lie L^s Glu Asn Lys AJa 
130 135 U 

Arg val lie Phe lie Thr Gly Tyr Met Ser Ser Gli Ser Leu Lys Gin 
145 150 155 \ 160 

Phe Leu Lys Lys Thr Gin Lys Asn His Gin Thr Met isn Glu Lys Ala 
165 170 \ 175 



Trp Lys Arg Trp Cys Thr Gin He Leu Ser Ala Leu Set Tyr Leu His 



130 



185 



\190 



Ser Cys Asp Pro Pro He He His Gly Asn Leu Thr Cys Asp Thr He 

195 200 205 

Phe He Gin His Asn Gly Leu He Lys He Gly Ser Val Ala Pro Asp 
210 215 220 



Thr lie Asn Asn His Val Lys Thr Cys Arg Glu Glu Gin Lys Asn Leu 
225 230 235 240 



His Phe Phe Ala Pro Glu Tyr Gly Glu Val Thr Asn Val Thr Thr Ala 
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245 250 255 

Val Asp lie Tyr Ser Phe Gly Met Cys Ala Leu Gly Met Ala Val Leu 
260 265 ' 270 

Glu lie Gin Gly Asn Gly Glu Ser Ser Tyr Val Pro Gin Glu Ala lie 
275 280 285 

Ser Ser Ala He Gin Leu Leu Glu Asp Pro Leu Gin Arg Glu Phe lie 
290 295 300 

Gin Lys Cys Leu Gin Ser Glu Pro Ala Arg Arg Pro Thr Ala Arg Glu 
305 310 315 320 

Leu Leu Phe His Pro Ala Leu Phe Glu Val Pro Ser Leu Lys Leu Leu 
325 - 330 335 

Ala Ala His Cys He Val Gly His Gin His Met He Pro Glu Asn Ala 
340 345 350 

Leu Glu Glu lie Thr Lys Asn Met Asp Thr Ser Ala Val Leu Ala Glu 
355 360 365 

He Pro Ala Gly Pro Gly Arg Glu Pro Val Gin Thr Leu Tyr Ser Gin 
370 ,375 380 

Ser Pro Ala Leu Glu Leu Asp Lys Phe Leu Glu Asp Val Arg Asn Gly 
385 390 395. 400 

He Tyr Pro Leu Thr Ala Phe Gly Leu Pro Arg Pro Gin Gin Pro Gin 
405 410 415 

Gin Glu Glu Val Thr Ser Pro Val Val Pro Pro Ser val Lys Thr Pro 
420 425 430 

Thr Pro Glu Pro Ala Glu Val Glu Thr Arg Lys Val Val Leu Met Gin 
435 440 445 

cys Asn He Glu ser val Glu Glu Gly val Lys His His Leu Thr Leu 
450 455 460 

Leu Leu Lys Leu Glu Asp Lys Leu Asn Arg His Leu Ser Cys Asp Leu 
465 470 475 430 



Mec Pro Asn Glu Asn He Pro Glu Leu Ala Ala Glu Leu Val Gin Leu 
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485 490 495 

Gly Phe He Ser Glu Ala Asp Gin Ser Arg Leu Thr Ser Leu Leu Glu 
500 505 S10 

Glu Thr Leu Asn Lys Phe Asn Phe Ala Arg Asn Ser Thr Leu Asn Ser 
515 520 525 

Ala Ala Val Thr Val Ser Ser 
530 535 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GCACAGTCGA CCAAGCCGGA GTCGCAGAG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



# 
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GC AC AAAGCT TGCCAGGAGG GGTCTGGCTG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

r\ 

Inj ( ii) MOLECULE TYPE: DNA • 



3 -3f 



(xi) SEQUENCE DESCRIPTION: .SEQ ID NO:13: 
GCACAACCAT GGCCAAGCCG GAGTCGCAGG AG 
(2) INFORMATION FOR SEQ ID NO: 14; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



GCACAAGATC TCCAGGAGGG GTCTGGCTC 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(DJ TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 



Lys Pro Glu Ser Gin Glu Ala Ala Pro Leu Ser Gly Pro Cys 
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5 10 
(2) INFORMATION . FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Glu Aap Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu Arg Leu Trp Asp 
5 10 15 

Cys 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 amino acids 
{ B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



Gly Tyr He Lye Glu Asp Glu Ala Leu Pro -Ser Pro His Thr Leu Gin Cys 
5 1C 15 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 



(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCACAGGTAC CGAGGCCATG GGCGCGCGC 



29 
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(2) INFORMATION FOR SEQ ID NO; 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GCACATCTAG ATCAGTGGTG GTGGTGGTGG TGGACCGGCC CCAGGAGTGG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GCACAGCGGC CGCGAGGCCA TGGGCGCGCG C 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 52 base pairs 
(5) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCACAGCGGC CGCTCAGTGG TGGTGGTGGT GGTGCCAGGA GGGGTCTGGC 
(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 
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( A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 
' (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CTGACTTCCA TGCCATCCTT 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GCTCACGACT CCAATCTGAT . 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 24; 

Arg He Val Gly Gly 
5 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 959 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 856 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

C GAC CTA TTG TCA GGG CCC TGC GGT CAC AGG ACC ATC CCT TCC CGT 
Asp Leu Leu Ser Gly Pro Cys Gly His Arg Thr He Pro Ser Arg 
15 10 15 

ATA GTG GGT GGC GAT GAT GCT GAG CTT GGC CGC TGG CCG TGG CAA GGG 
He Val Gly Gly Asp Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly 
20 25 30 

AGC CTG CGT GTA TGG GGC AAC CAC TTA TGT GGC GCA ACC TTG CTC AAC 
Ser Leu Arg Val Trp Gly Asn His Leu Cys Gly Ala Thr Leu Leu Asn 
35 40 45 

CGC CGC TGG GTG CTT ACA GCT GCC CAC TGC TTC CAA AAG GAT AAC GAT 
Arg Arg Trp Val Leu Thr Ala Ala His Cys Phe Gin Lys Asp Asn Asp 
50 55 60 

CCT TTT GAC TGG ACA GTC CAG TTT GGT GAG CTG ACT TCC AGG CCA TCT 
Pro Phe Asp Trp Thr Val Gin Phe Gly Glu Leu Thr Ser Arg Pro Ser 
65 70 75 

CTC TGG AAC CTA CAG GCC TAT TCC AAC CGT TAC CAA ATA GAA GAT ATT 
Leu Trp Asn Leu Gin Ala Tyr Ser Asn Arg Tyr Gin He Glu Asp He 
80 85 90 95 

TTC CTG AGC CCC AAG TAC TCG GAG CAG TAT CCC AAT GAC ATA GCC CTG 
Phe Leu Ser Pro Lys Tyr Ser Glu Gin Tyr Pro Asn Asp He Ala Leu 
100 105 HO 



CTG AAG CTG TCA TCT CCA GTC ACC TAC AAT AAC TTC ATC CAG CCC ATC 

Leu Lys Leu Ser Ser Pro Val Thr Tyr Asn Asn Phe He Gin Pro He 
115 120 125 



S2< 



TGC CTC CTG AAC TCC ACG TAC AAG TTT GAG AAC CGA ACT GAC TGC TGG 
Cys Leu Leu Asn Ser Thr Tyr Lys Phe Glu Asn Arg Thr Asp Cys Trp 
130 135 140 



430 



GTG ACC GGC TGG GGG GCT ATT GGA GAA GAT GAG AGT CTG CCA TCT CCC 
Val Thr Gly Trp Gly Ala lis Gly Glu Asp Glu Ser Leu Pro Ser Pro 
145 150 155 



478 



b 



01 

ss 



.AAC ACT CTC CAG GAA GTG CAG GTA GCT ATT ATC AAC AAC AGC ATG TGT 526 
Asn Thr Leu Gin Glu Val Gin Val Ala He He Asn Asn Ser Met Cys 
160 "165 170 175 

AAC CAT ATG TAC AAA AAG CCA GAC TTC CGC . ACG AAC ATC TGG GGA GAC 574 
Asn His Met Tyr Lys Lys Pro Asp Phe Arg Thr Asn He Trp Gly Asp 
ISO ' 185 190 

ATG GTT TGC GCT GGC ACT CCT GAA GGT GGC AAG GAT GCC TGC TTT GGT 622 
Met Val Cys Ala Gly Thr Pro Glu Gly Gly Lys Asp Ala Cys Phe Gly 
195 200 205 



o 



GAC TCG GGA GGA CCC TTG GCC TGC GAC CAG GAT ACG GTG TGG TAT CAG 
Asp Ser Gly Gly Pro Leu Ala Cys Asp Gin Asp Thr Val Trp Tyr Gin 
210 215 220 



670 



GTT GGA GTT GTG AGC TGG GGA ATA GGC TGT GGT CGC CCC AAT CGC CCT 
Val Gly Val Val Ser Trp Gly He Gly Cys Gly Arg Pro Asn Arg Pro 
225 ' 230 235 



718 



GGA GTC TAT ACC AAC ATC AGT CAT CAC TAC AAC TGG ATC CAG TCA ACC 
Gly Val Tyr Thr Asn He Ser His His Tyr Asn Trp He Gin Ser Thr 
240 245 250 255 



766 



ATG ATC CGC AAT GGG CTG CTC AGG CCT GAC CCA GTC CCC TTG CTA CTG 
Mec He Arg Asn Gly Leu Leu Arg Pro Asp Pro Val Pro Leu Leu Leu 
260 265 270 



B14 



TTT CTT ACT CTG GCC TGG GCT TCC TCT TTG CTG AGG CCT GCC 
Phe Leu Thr Leu Ala Trp Ala Ser Ser Leu Leu Arg Pro Ala 
275 280 285 



356 



TGAGCCCACA CGTGTACGTC ACACCTGTGA GGTCAGGGTG TGTCTCTTTT GTATCTTGCT 



916 



TGCTAATAAA CCTGTTAATA TTTAAAAAAA AAAAAAAAAA AAA 



959 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: " 

(Al LENGTH: 285 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Asp Leu Leu Ser Gly Pro Cys Gly His Arg Thr lie Pro Ser Arg lie 
15 10 IS 

Val Gly Gly Asp Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser 
20 25 30 

Leu Arg Val Trp Gly Asn His Leu Cys Gly Ala Thr Leu Leu Asn Arg 
35 40 45 

Arg Trp Val Leu Thr Ala Ala His Cys Phe Gin Lys Asp Asn Asp Pro 
50 55 60 

Phe Asp Trp Thr Val Gin Phe Gly Glu Leu Thr Ser Arg Pro Ser Leu 
65 ?0 75 80 

Trp Asn Leu Gin Ala Tyr Ser Asn Arg Tyr Gin lie Glu Asp He Phe 
85 90 9S 

Leu Ser Pro Lys Tyr Ser Glu Gin Tyr Pro Asn Asp He Ala Leu Leu 
100 105 110 

Lys Leu Ser Ser Pro Val Thr Tyr Asn Asn Phe He Gin Pro He Cys 
115 120 125 

Leu Leu Asn Ser Thr Tyr Lys Phe Glu Asn Arg Thr Asp Cys Trp Val 
130 135 140 

Thr Gly Trp Gly Ala He Gly Glu Asp Glu Ser Leu Pro Ser Pro Asn 
145 150 155 160 

Thr Leu Gin Glu Val Gin val Ala He He Asn Asn Ser Met Cys Asn 
165 170 175 

His Met Tyr Lys Lys Pro Asp Phe Arg Thr Asn He Trp Gly Asp Met 
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180 185 190 

Val Cys Ala Gly Thr Pro Glu Gly Gly Lys Asp Ala Cys Phe Gly Asp 
195 200 205 

Ser Gly Gly Pro Leu Ala Cys Asp Gin Asp Thr Val Trp Tyr Gin Val 
210 215 220 

Gly Val val Ser Trp Gly He Gly Cys Gly Arg Pro Asn Arg Pro Gly 
\* 225 230 235 240 

□ val Tyr Thr Asn He Ser His His Tyr Asn Trp He Gin Ser Thr Met 

j* 245 250 255 

ff? He Arg Asn Gly Leu Leu Arg Pro Asp Pro Val Pro Leu Leu Leu Phe 

11 260 265 270 

s Leu Thr Leu Ala Trp Ala Ser Ser Leu Leu Arg Pro Ala 

|=| 275 280 285 

\l (2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:27 : 

AGTGAGTCTC CTGCCTCAGC CTCCCAAGTA GCTGGGACTT CAGGTGTGTG CCACCATCCT 60 

CAGCTAATTT TTTTTTTTTT TTTTTTTTTG AGAAGGAGTC TTGCTCTGTC GCCCAGGCTG 12 0 

GAGTGCAGTG GCGCGATCTT CCAGGCCCCA CCGGGCCCTC AGGAAGGCCT TGCCTACCTG 180 

CTTTAAGGGG ACTCCTGGCT CAGGGCCAGG CCCCTGGTGC TGGAGGAGGT GGTGGGTGGA 240 

GGGCAGGGGG CACCAAGCGG GCAGCCAGGA CCCCCGGGCT GCAGACAAGA AAAGGACTGT 3 00 
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GGG GTCCACC GGGTCTGGGC CACATCAAGG AATGTGGTTG AAGACCCGCC CTTAGGAGCT 3 60 

GAAAGCCAGG GCGCTACCAG GCCTGAGAGG CCCCAAACAG CCCTTGGGCC TGGTTTGGGA 420 

: GGATTAAGCT GGAGCTCCCA ACCCGCCCTG CCCCCAGGGG GCGACCCCGG GCCCGGCGCG 480 

AGAGGAGGCA GAGGGGGCGT CAGGCCGCGG GAGAGGAGGC CATGGGCGCG CGCGGGGCGC 540 
TGCTGCTGGC GCTGCTGCTG GCTCGGGCTG GACTCAGGAA GCCGGGTGAG CTCGGGGCGC 600 

|== TGCTGGCGGG ATGGGCAGGC GGGGGAGCGG TGGGGAGGAC GGGAGGTGGA GGCCGCGGGG 660 

P 

C5 AGTCACTTCT TGTCTCCCGC AGAGTCGCAG GAGGCGGCGC CGTTATCAGG TAGGGCGCCC 720 

O AGGACGCGCG ATTCCTGCCA GGGCCGTTGG GCCGAGGTGG ACGGGGGGCG GTGAGGGGGT 7 30 

Cn 

41 AGAGGGGGGC CTTTACTGCT CTCTCGCCCC CGCCCCCGGG ATCGAGAACT CTGTTGGCGT 840 

Z. 3 

5 GGAAAGTAAC TAACGGACGC TGGAGGGGGA TGGGCGGGCC CTGCAGAGCA CGTGGGAGGA 900 

M TCTCCAGTGT CACCTACTTC CTGCTGCACA CACGCGAGGG GACCCTGGGT GGGCAAAAAC 960 

S\ GTGCTTTCCC GGACGGGGTT GAAGGGGAGA AAGGGAGAGG TCGGGCTTGG GGGGCTGCCT 1020 

f|| CCCGCGGCTC AGCAGTTCCT CTGACCATCC GAGGACCATG CGGCCGACGG GTCATCACGT 10 80 

CGCGCATCGT GGGTGGAGAG GACGCCGAAC TCGGGCGTTG GCCGTGGCAG GGGAGCCTGC 1140 

GCCTGTGGGA TTCCCACGTA TGCGGAGTGA GCCTGCTCAG CCACCGCTGG GCACTCACGG 1200 

CGGCGCACTG CTTTGAAACG TGAGTGG GGG TGCGAACGGA GGGGTGCGGG GACGGGCAGG 12 60 

AACAGGGCTG GAGGGAGTGC CACCGAACTT TACCTCTGGT CTGATGCCAG ACTTGGGCGT 1320 

GAAAGTTGTG CGTGGATGCG GCCTGGTGTT CTCCTGAGCC CCAGGCTGTG CTGCAGCCGG 13 30 

TTACACCCAC TCCAGTTCCC TTTGGGTCTC CTGGAGGGAA CCCTGTTCAG GTTATTCCAG 1440 

AATGTTCTTC CAGAACATTT CCACACACTT TTGGGTATTC TCTCCCTTTT TCTTTCAACC 1500 

CAAAGTTCAC CACTGACCAT CCCACCCTCA TCCCCCCTCC TGGTGGACGG TGCGGTACAG 1560 

TGTGGGGCAC TGAGCCAAGG CCAGCACCCC CGGGCCGCTG TGTGGACTCC ATCCTGCCAA 1620 

TCCCACMTG GCGTGGTGCA TCTCCCCATT CC?CCTTGGG CTGCATGGGG GTGCCCCTGG 1680 
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AGGCCTTGGC TCAATGCAAG GCTCCTTGGG ACAGCTCTGG GAGGTGACAA GACCCCACCC 1740 

TTCTGCTGCA GGAGCAGGTC CTAGGACTTT GGTTGTGGTC TGTCTGGGCT CCTTCATTTC 1800 

TGCAGGGGAC CCTGGGTGTT AGCAAGTAGC AGCAACACCA CAGTTTCCCC TCCTGCACTG 1860 

GACCCCAGTT GTGCTCAGGT AGCCAGCCCT CCATCCAGGG CCCCTGACTG CTCTCTTCTC 1920 

TTCTGCCAGC TATAGTGACC TTAGTGATCC CTCCGGGTGG ATGGTCCAGT TTGGCCAGCT 1980 

GACTTCCATG CCATCCTTCT GGAGCCTGCA GGCCTACTAC ACCCGTTACT TCGTATC GAA 2040 

JJSS. 

O TATCTATCTG AGCCCTCGCT ACCTGGGGAA TTCACCCTAT GACATTGCCT TGGTGAAGCT" 2100 

as 

"~r** , 

£3 GTCTGCACCT GTCACCTACA CTAAACACAT CCAGCCCATC TGTCTCCAGG CCTCCACATT 2 ISO 

Cfl 

41 TGAGTTTGAG AACCGGACAG ACTGCTGGGT GACTGGCTGG GGGTACATCA AAGAGGATGA 2220 

SI 

3 GGGTGAGGCT GGGGACAGGC GGGTCAGGGA GGAACTGTCT TTGTTCACCT GTTCCCCTGC 2280 

jUi ATAGGCACAA TAGCCCCCTG CTTGGTCTGG GGGTGCAGGC TATGCCCCTC TTGCTTGCAG 234 0 

1=1 

SI TCTCTCCTCA CCTGCCAGGG CAGGGACCAA ACACCCAGTT CTCTCCCTTC CAGGGGCTGT 2400 

GGGGGCCAGA AGGAGAGTGT GAGAGGGAGG CCAGTTTGGC GCAAGCCTGT GGGTGGTGCG 2460 

GTGGTGGAGG GGTTCTGGAG GGCTTGGCGA CATAAACCTC ATACTTGGAT TTATTCCTGC 2520 

ATCTTTCCAC CTCCCCCAGT GCTCACCAAT GCCCCAGGCA TCACCAGGTT GCCCCTTCCC 25e0 

CCAAGGTCTG GCTTTGGATG CTTATGTGAA CACCGTTTTA AGTTGCCTTG GCCCCTTCCT 2640 

CGGTTCCTTT TTGGCTGAGG AATCTCTCCA TGGCTGCAGG CAGGGCCATT GTTGCCATTC 2700 

TACAGATAGG GAAAGTGCGG CTGGGGGAGC TCTGACAGCT GTCCCTCCCC GGGGCCTTCT 2760 

GTGATGCTGC TGAGGGCCTC TGTTGTGCTG GGGTCTGGGT TGGAGCTGGG GGTAATGGAG 2 820 

ATGAACCTGC CAGGCACAGT GGGTGCCCCA GGGCCCCCAC CCCCGCAGCC TATGCCATCC 2 8 SO 

CTCCATAGAG GGGCCTCAGG TTGCTGTCTC TCTCCTTCCC ACTATCGTCC GCACAGCACT 2940 

GCCATCTCCC CACACCCTCC AGGAAGTTCA GGTCGCCATC ATAAACAACT CTATGTCCAA 3000 

CCACCTCTTC GTCAAGTACA GTTTCCGCAA GGACATCTTT GGAGACATGG TTTGTGCTGG 3 0 60 
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CAATGCCCAA GGCGGGAAGG ATGCCTGCTT CGTGAGTGTC CTTGCCACCA CTCCCAGCCC 3120 

AGGAAAGCAT CCTGTGTCCC TGTGCCTTAT TTGACCCTCA TGCCAACCCC GGGAGGTGGA 3180 

GACTGTTGCC CCACTCTGCA GATGCAGAAA CGGAGGCTTG GCTGCTGCCA GGGGGAGGAG 3240 

GAGGATGTGC ACCCAGTCTA CCCAGCCCCA TAGCCCTTCC CACTCTCAGC CCCTCCCCTG 3300 

CCCCACTCAC TCTGCCCCAG GCTGACCTCA GCCCCGCTGC TCCCCAGGGT GACTCAGGTG 33 60 

GACCCTTGGC CTGTAACAAG AATGGAC TGT GGTATCAGAT TGGAGTCGTG AGCTGGGGAG 342 0 

u 

*jl TGGGCTGTGG TCGGCCCAAT CGGCCCGGTG TCTACACCAA TATCAGCCAC CACTTTGAGT 3480 

=== 

'III GGATCCAGAA GCTGATGGCC CAGAGTGGCA TGTCCCAGCC AGACCCCTCC TGGCCGCTAC 3540 

in 

SB 

7] TCTTTTTCCC TCTTCTCTGG GCTCTCCCAC TCCTGGGGCC GGTCTGAGCC TACCTGAGCC 3 600 

!L CATGCAGCCT GGGGCCACTG CCAAGTCAGG CCCTGGTTCT CTTCTGTCTT GTTTGGTAAT 3660 

U 

U % AAACACATTC CAGTTGATGC CTTGCAGGGC ATTCTTCAAA AGCAGTGGCT TCATGGACAG 3720 

% 4 

CTCATTCTCT CTTGTGCAGA CAGCCTGTCT GTGCCCCTGG CTCACACCCA CATCTGTTCT 3780 



fil 



GCACCATAGA ACCATCTGGT TATTTCGATC AGAAAGAGAA TTGTGTGTTG CCCAGGCTGG 3340 
TCTTGAACGC CTAGGGTGTC TCGATC 3866 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO:28: 
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CTGAACCGGG TTGTGGGCGG CGAGGACAGC ACTGACAGCG AGTGGCCCTG GATCGTGAGC 
ATC CAGAAGA ATGGGACCCA CCACTGCGCA GGTTCTCTGC TCACCAGCCG CTGGGTGATC 
ACTGCTGCCC ACTGTTTCAA GG AC AACCTG AACAAACCAT ACCTGTTCTC TGTGCTGCTG 
GGGGCCTGGC AGCTGGGGAA CCCTGGCTCT CGGTCCCAGA AGGTGGGTGT TGCCTGGGTG 
GAGCCCCACC CTGTGTATTC CTGGAAGGAA GGTGCCTGTG CAGACATTGC CCTGGTGCGT 
CTCGAGCGCT CCATACAGTT. CTCAGAGCGG GTCCTGCCCA TCTGCCTACC TGATGCCTCT 
ATCCACCTCC CTCCAAACAC CCACTGCTGG ATCTCAGGCT GGGGGAGCAT CCAAGATGGA 
GTTCCCTTGC CCCACCCTCA GACCCTGCAG AAGCTGAAGG TTCCTATCAT CGACTCGGAA 
GTCTGCAGCC ATCTGTACTG GCGGGGAGCA GGACAGGGAC CC ATC AC TG A GGACATGCTG 
TGTGCCGGCT ACTTGGAGGG GGAGCGGGAT GCTTGTCTGG GCGACTCCGG GGGCCCCCTC 
ATGTGCCAGG TGGACCGCGC CTGGCTGCTG GCCGGCATCA TCAGCTGCGG CGAGGGCTGT 
GCCGAGCGCA ACAGGCCCGG GGTCTACATC AGCCTCTCTG CGCACCGCTC CTGGGTGGAG 
AAGATCGTGC AAGGGGTGCA GCTCCGCGGG CGCGCTCAGG GGGGTGGGGC CCTCAGGGCA 
CCGAGCCAGG GCTCTGGGGC CGCCGCGCGC TCCTAGGGCG CAGCGGGACG CGGGGCTCGG 
ATCTGAAAGG CGGCCAGATC CACATCTGGA TCTGGATCTG CGGCGGCCTC GGGCG GTTTC 
CCCCGCCGTA AATAGGCTCA TCTACCTCTA CCTCTGGGGG CCCGGACGGC TGCTGCGGAA 
AGGAAACCCC CTCCCCGACC CGCCCGACGG CCTCAGGCCC CGCCTCCAAG GCATCAGGCC 
CCGCCCAACG GCCTCATGTC CCCGCCCCCA CGACTTCCGG CCCCGCCCCG GGCCCCAGCG 
CTTTTGTGTA TATAAATGTT AATGATTTTT ATAGGTATTT GTAACCCTGC CCACATATCT 
TATTTATTCC TCCAATTTCA AT AAA 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 3 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
AATGCGGCCA CTCCAAGGAG GCCGGGAGGA TTGTGGGAGG CCAAGACACC CAGGAAGGAC 
GCTGGCCGTG GCAGGTTGGC CTGTGGTTGA CCTCAGTGGG GCATGTATGT GGGGGCTCCC 
TCATCCACCC ACGCTGGGTG CTCACAGCCG CCCACTGCTT CCTGAGGTCT GAGGATCCCG 
GGCTCTACCA TGTTAAAGTC GGAGGGCTGA CACCCTCACT TTCAGAGCCC CACTCGGCCT 
TGGTGGCTGT GAGGAGGCTC CTGGTCCACT CCTCATACCA TGGGACCACC ACCAGCGGGG 
ACATTGCCCT GATGGAGCTG GACTCCCCCT TGCAGGCCTC CCAGTTCAGC CCCATCTGCC 
TCCCAGGACC CCAGACCCCC CTCGCCATTG GGACCGTGTG CTGGGTAAAC GGGCTGGGGG 
TCCACTCAGG AGAGGCCCTG GCGAGTGTCC TTCAGGAGGT GGCTGTGCCC CTCCTGGACT 
CGAACATGTG TGAGCTGATG TACCACCTAG GAGAGCCC AG CCTGGCTGGC CAGCGCCTCA 
TCCAGGACGA CATGCTCTGT GCTGGCTCTG TCCAGGGCAA GAAAGACTCC TGCCAGGGTG 
ACTCCGGGGG GCCGCTGGTC TGCCCCATCA ATGATACGTG GATCCAGGCC GGCATTGTGA 
GCTGGGGATT CGGCTGTGCC CGGCCTTTCC GGCCTGGTGT CTACACCCAG GTGCTAAGCT 
ACACAGACTG GATTCAGAGA ACCCTGGCTG AATCTCACTC AGGCATGTCT GGGGCCCGCC 
CAGGTGCCCC AGGATCCCAC TCAGGCACCT CCAGATCCCA CCCAGTGCTG CTGCTTGAGC 
TGTTGACCGT ATGCTTGCTT GGGTCCCTGT GAACCATGAG CCATGGAGTC CGGGATCCCC 
TTTCTGGTAG GATTGATGGA ATCTAATAAT AAA 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 980 base pairs 
(5) TYPE: nucleic acid 

(C) STRANDEDNESS: single ( 

(D) TOPOLOGY: linear 

(iij MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CCTGTGGTCG CCCCAGGATG CTGAACCGAA TGGTGGGCGG GCAGGACACG CAGGAGGGCG 
AGTGGCCCTG GCAAGTCAGC ATCCAGCGCA ACGGAAGCCA CTTCTGCGGG GGCAGCCTCA 
TCGCGGAGCA GTGGGTCCTG ACGGCTGCGC ACTGCTTCCG CAACACCTCT GAGACGTCCC 
TGTACCAGGT CCTGCTGGGG GCAAGGCAGC TAGTGCAGCC GGGACCACAC GCTATGTATG 
CCCGGGTGAG GCAGGTGGAG AGCAACCCCC TGTACCAGGG CACGGCCTCC AGCGCTGACG 
TGGCCCTGGT GGAGCTGGAG GCACCAGTGC CCTTCACCAA TTACATCCTC CCCGTGTGCC 
TGCCTGACCC CTCGGTGATC TTTGAGACGG GCATGAACTG CTGGGTCACT GGCTGGGGCA 
GCCCCAGTGA GGAAGACCTC CTGCCCGAAC CGCGGATCCT GCAGAAACTC GCTGTGCCCA 
TCATCGACAC ACCCAAGTGC AACCTGCTCT ACAGCAAAGA CACCGAGTTT GGCTACCAAC 
CCAAAACCAT CAAGAATGAC ATGCTGTGCG CCGGCTTCGA GGAGGGCAAG AAGGATGCCT 
GCAAGGGCGA CTCGGGCGGC CCCCTGGTGT GCCTCGTGGG TCAGTCGTGG CTGCAGGCGG 
GGGTGATCAG CTGGGGTGAG GGCTGTGCCC GCCAGAACCG CCCAGGTGTC TACATCCGTn 
TCACCGCCCA CCACAACTGG ATCCATCGGA TCATCCCCAA ACTGCAGTTC CAGCCAGCGA 
GGTTGGGCGG CCAGAAGTGA GACCCCCGGG GCCAGGAGCC CCTTGAGCAG AGCTCTGCAC 
CCAGCCTGCC CGCCCACACC ATCCTGCTGG TCCTCCCAGC GCTGCTGTTG CACCTGTGAG 
CCCCACCAGA CTCATTTGTA AATAGCGCTC CTTCCTCCCC TCTCAAATAC CCTTATTTTA 
TTTATGTTTC TCCCAATAAA 



